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What is claimed is ; 

i. A method of evaluating characters in a message to generate a search index, 
contorising the steps of: 

a) accepting an input of the characters of the message; 

b) \ evaluating the message by comparing the characters of the message to 
a predetemiined^et of candidate character sets to determine a match between the 
predetermined set ofxandidate character sets and the message; and 

c) generatmg a search index based on the results of the evaluation of the 
message and candidate chapter sets. 

QiQ> ^N. method of claim 1, wherein the comparing of step (b) comprises the step 

of cm^^mring each character of the message to an entry for each of the candidate 
character sels4n a character table bank. 

3. The method of claim 2, wherein the step of comparing each character 
comprises the step of testing the ability of each candidate character set to express that 
character by performing a logical mask between a universal code for that character 
and an indicator in the character table bank indicating whether each of the candidate 
character sets contains that character. 

4. The method of claim 3, wherein the universal code is Unicode. 

5. The method of claim 1 , further comprising the step of (d) computing a total 
number of characters matched to each of the candidate character sets. 

6. The method of claim 1, further comprising the step of (e) selecting a best 
match based upon the total number of characters matched to each of the candidate 
character sets. 

7. The method of claim 1 , further comprising the step of (f) evaluating the 
characters of a query string. 

8. The method of claim 7, further comprising the step of (g) performing a search 
of the query string against search indices whose character set matches the character set 
of the query string. 
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^\i£> C)^^ \ ^ system for evaluating characters in a message to generate a search index, 
con^rising: 

input interface to accept an input of the characters of the message; and 
a processor unit, connected to the input interface, the processor unit evaluating 
the message by comparing the characters of the message to a predetermined set of 
candidate charact^ sets to determine a match between the predetermined set of 
candidate character sights and the message, and generating a search index based on the 
results of the evaluatioirgf the message and candidate character sets. 
^0 (3 The system of claim 9, wherein the processor unit compares each character of 

the mfe$§age to an entry for each of the candidate character sets in a character table 
bank. 

1 1 . The system of claim 10, wherein the processor unit tests the ability of each 
candidate character set to express that character by performing a logical mask between 
a universal code for that character and an indicator in the character table bank 
indicating whether each of the candidate character sets contains that character. 

12. The system of claim 1 1, wherein the universal code is Unicode. 

13. The system of claim 9, wherein the processor unit computes a total nimiber of 
characters matched to each of the candidate character sets. 

14. The system of claim 9, wherein the processor imit selects a best match based 
upon the total number of characters matched to each of the candidate character sets. 

1 5. The system of claim 9, wherein the processor imit evaluates the characters of a 
query string. 

16. The system of claim 15, wherein the processor unit perfomis a search of the 
query string against search indices whose character set matches the character set of the 
query string. 

^vi& €>S^ A system for evaluating characters in a message to generate a search index, 

comprising: 

ijj^Ht interface means to accept an input of the characters of the message; and 
processor means, connected to the input interface means, the processor means 
evaluating the menage by comparing the characters of the message to a 
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prg^determined set of candidate character sets to determine a match between the 
predetOTnined set of candidate character sets and the message, and generating a search 
index basisd on the results of the evaluation of the message and candidate character 
sets. 

The system of claim 17, wherein the processor means compares each character 
of the^essage to an entry for each of the candidate character sets in a character table 
bank. 

19. The system of claim 1 8, wherein the processor means tests the ability of each 
candidate character set to express that character by performing a logical mask between 
a universal code for that character and an indicator in the character table bank 
indicating whether each of the candidate character sets contains that character. 

20. The system of claim 19, wherein the universal code is Unicode. 

21 . The system of claim 17, wherein the processor means computes a total number 
of characters matched to each of the candidate character sets. 

22. The system of claim 17, wherein the processor means selects a best match 
based upon the total number of characters matched to each of the candidate character 
sets. 

23. The system of claim 17, wherein the processor means evaluates the characters 
of a query string. 

24. The system of claim 23, wherein the processor means performs a search of the 
query string against search indices whose character set matches the character set of the 
query string. 

A storage medium for storing machine readable code, the machine readable 
co<ibJ)eing executable to evaluate characters in an electronic message according to the 
steps ol 

a) \ accepting an input of the characters of the message; 

b) \evaluating the message by comparing the characters of the message to 
a predetermined set of candidate character sets to determine a match between the 
predetermined set of candidate character sets and the message; and 
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c),^ generating a search index based on the results of the evaluation of the 
message andsfhe candidate character sets. 

The storage medium of claim 25, wherein the comparing of step (b) comprises 
the s^to of comparing each character of the message to an entry to each of the 
candidate^^iaracter sets in a character table bank. 

27. The storage mediimi of claim 26, wherein the step of comparing each character 
comprises the step of testing the ability of each candidate character set to express that 
character by performing a logical mask between a vmiversal code for that character 
and an indicator in the character table bank indicating whether each of the candidate 
character sets contains that character. 

28. The storage medium of claim 27, wherein the imiversal code is Unicode. 

29. The storage medium of claim 25, wherein the steps further comprise the step 
of (d) computing a tot^ nimiber of characters matched to each of the candidate 
character sets. 

30. The storage medium of claim 25, wherein the steps further comprise the step 
of (e) selecting a best match based upon the total niunber of characters matched to 
each of the candidate character sets. 

3 1 . The storage medium of claim 25, wherein the steps further comprise the step 
of (f) evaluating the characters of a query string. 

32. The storage medium of claim 31, wherein the steps further comprise the step 
of (g) performing a search of the query string against search indices whose character 
set matches the character set of the query string. 



